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This paper analyzes the appearance and distribution of noise on 
a spectrum obtained by a Fourier transformation of an int erf erogr am. 
which was the source of the spectrum and noise. 

This is accomplished through a discrete formulation by use of 
sampling theory and statistical methods. As a consequence of this 
formulation, a noise comparison of conventional spectrometers and 
interferometers is readily obtained which verifies that Fellgett’ s 
advantage does hold fdr interferometers with non-background limited 
detectors . C ’ 
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IHTRODUCTION 


Hie purpose of 'this paper is to analyze the statistical characteristics of 
noise existing on a spectrum when the spectrum has been obtained by means of a 
two-beam interferometer spectrometer . In this anal ysis the distribution of noise 
associated with a spectrum obtained by a Fourier transform of an interferogram 
will be derived. Also, it will be shown that the use of an interferometer spec- 
troneter has a •Ju signal- to-noise advantage over a conventional spectrometer 
when the detector used is non-background limited, M being the number of resolv- 
able spectral components . Ibis advantage in signal-to-noise ratio is commonly 
referred to as Fellgett’s advantage. 

THE 3UTERFERCX21AM AND ITS TRANSFORM 


A considerable number of papers have been written on the interferometer 

spectrometer ; consequently, the interferometer and its basic theory will be 
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discussed here only to the extent needed for orientation. 

Assume that an interferometer is illuminated by a monochromatic coherent 
source of constant intensity. Then, the interference intensity wave is 

l(t) = I (l+cos7ut) 

where u> = 2itf , f = c/A , 7 = 2v/c 

I is the light intensity, A is the wavelength of light, c is the velocity of 
light, and v is the velocity of the interferometer mirror. For heterochromatic 
radiation the interference wave becomes 



l(cj) [l+cos7wt]dw 


( 1 ) 
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This intensity wave is incident on a detector; thus, the output of the detector 
is proportional to (l) and is of the form 


pOO 

e(t) = c + / d(w)cos[7uJt-Hp(u)) jdw 


( 2 ) 


The right-hand term of ( 2 ) is then essentially an electrical anal og of the 




7 . Tha phase angles «p(w) will he zero, provided input intensity fluctuations 
do not occur, all system parameters are linear, and the proper time origin is 
established. The desired spectrum is obtained from the Fourier transform of the 
non d-c term of the interferogram Eq. ( 2 ) , and the transform may be expressed 
as 


poo p 00 

3i e(t)] = E(u>) = / e(t)cosu»tdt+j / e(t)sinutdt 

^ —00 ^ —CO 


(3) 


where the first and second integrals transform the even and odd parts of e(t) , 
respectively. In particular, if the phase angle cp(w) = 0 for all u> in ( 3 ) , 
then e(t) is an even function and the second integral in ( 3 ) is zero. 


DISCRETE FORMULATION 


In general, most spectra observed will be continuous. However, provided 
that sampling theory is correctly applied, the formulas of Fourier transformation 
spectroscopy may be expressed in a discrete form as well as the more familiar 
integral representation. For the purpose of this paper the discrete formulation 
has several advantages and will therefore be used in the following analysis. 

flampl i rtg theory shows 3 that if a function is time- limited to T seconds 
(i.e., the function is zero at all points outside an interval T seconds in 
duration) and contains negligible frequency components greater than B cycles 
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per second, then the function (and therefore its Fourier transform) may he 
completely determined (i.e., reconstructed) by 2BT samples taken in the time 
domain at intervals of l/2B seconds 4 or in the frequency domain at intervals of 
l/T cycles per second from -B to +B. 

If sampling theory is used then one may ■write Eq. (l) as 

M 

l(t) = Ii(l+cos7u)jt) (M 

i=i 


where 

Equation (2) becomes 


uq = 2itfi and ft = cfh± 


M 

e(t) = c + y^djcos(7Mjt+cPi) 


i=i 


and (3) may be expressed as 


BT BT 

E^) = Ati ^e^^jcosu^ti+jAti ^e^Jsinu-^ti 


i=-BT 


i=-BT 


(5) 


( 6 ) 


where 


cfc = 2flk/T, k = ±1, ±2, •••, ±BT, Ati = l/2B, t± = i/2B 


and 



t=i/2B 


Several observations should be made at this point with regard to the 
generality of (4) and (5) • First, owing to physical limitations of the instrument, 
(4) holds only for a limited observation time T. Hence, the true detector signal 
is not (5) but is ej(t), where 


1 


ej(t) = e(t)h(t) with h(t) = 


-T/2 < t < T/2 


0 


elsewhere 


(7) 


and e(t), is given by ( 5 ). However, the Fourier transform of the ith term 
of the truncated vave e^Ct) is 

| ~sin 3 tT(f- 7 fj) sinflT(f+ 7 fj) " 

1 L «(f-7fi) *(f+7fi) - 

compared with the transform of the corresponding term of e(t) which is 

di [8(f-7fi) + Btf+rP-t 

Furthermore, the amplitude components, di, of Eq. ( 5 ) are proportional to the 
amplitudes of the magnitude of the Fourier transform at the sampling points 
= 2J&/T, that is, 

ai-2/l|le T (t))|^ 2 „ l/T (8) 

where the factor 2/T appears because Eq. ( 5 ) is in the form of a Fourier sin-cos 
series and there is a difference of a factor of 2/T in the definition of the 
series coefficients and of this transform. 

Secondly, the generality of representing the interference wave (l) by (4) 
using a discrete sum of M terms to cover the continuum of frequencies f x to fj^ 
needs to be considered. Essentially, the mechanism of the interference process 
and its detection is one of compressing and translating the spectrum of optical 
frequencies into a wave, e T (t), with a spectrum of frequencies, say, from 
0 to B cps, which may be in the audio-frequency range. This is accomplished by 
mapping the intensity pattern of the optical interferogram into its electrical 
representation eg(t) . The point which requires examination then is, how good 
will the representation of the desired optical spectrum be if it is obtained from 
the Fourier transform of eg(t) , which is both time-limited (T sec), approximately 
band- l im i t ed (B cps), and represented by a discrete number of independent terms. 
The answer to this question will be given later in this paper, where it is shown 
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that the number of resolvable components M, required to represent the true 
optical spectrum of a continuum of frequencies f x to f^ is BT. Since the 
signal e.ji(t) may be completely represented by 2BT independent data, according 
to sampling theory, the summations of (4) and ( 5 ) completely describe the desired 
optical spectrum within the resolution determined by eji(t) when the source under 
observation consists of a continuum of frequencies. It should be noted that BT 
independent amplitude data and BT independent phase data are required to com- 
pletely specify ej(t) , but the phase information is deleted in the computation 
of the magnitude of the Fourier transform. 

TRANSFORMATION OF ADDITIVE NORMAL IMERFEROGRAM NOISE 

As indicated in the previous section, an interferogram is a time record 
whose Fourier transform gives the spectral data of the observed source. Any 
noise in the t ime record is redistributed when the Fourier transformation is 
performed. There has been some confusion as to how such a transformation affects 
the noise in the spectrum, as compared with a conventional spectrometer. In the 
following section it will be shown that, although the noise is redistributed in 
a different form with different statistical parameters, the signal-to- noise ratio 
per spectral element of a spectrum derived from an interferogram does exhibit 
Fellgett ' s advantage. 

To determine the effect on the spectrum of noise appearing on the time 
function, first consider the noise probability density function and how it trans- 
forms. In doing this, it will be necessary to assume that the transformation 
process does not introduce additional noise power. This assumption is reasonable 
in practice, since discrete formulation is possible by sampling theory, thus 
allowing the operation to be performed by a digital computer program. 
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C Q 

Because of the detector noise ’ , every amplitude sample of the interf er ogr am 
e^(t) will he measured with seme uncertainty, say for the ith sample. 

Then the resultant interf er ogr am will he 

e T (t) = e T '(t)4€ T (t) (9) 

where e^Ct) is the true signal and e(t) is the noise, or in a sampled form 

ei * ei‘+€i, i = 0, ±1, ±2, ♦ • • , ±BT (10) 

Assume that the randan noise €fp(t ) belongs to a normal prohahility distrihution 
with zero mean. This statistical characterization will include all random nor- 
mally distributed noise that is additively superimposed on the true int er f er ogr am 
and whose samples are statistically independent. 

The prohahility density function for any error is given by 

P( e )de = exp de (ll) 

(2xa 2 ) 1,z 

where o 2 = variance of 6j_ . It can he verified that the variance, a 2 , is 
equal to the average power of the anplitudes of e(t) . Thus, Eq. (ll) expresses 
the statistics of the time dcmain noise, e (t ) , on the interferogram e^(t). 

Now that the time domain noise to he considered has been formulated, the goal 
will he to statistically characterize this noise in the transformed spectrum. 

€rp(t) expressed in terms of its sample form may he transformed by (6) which 
is far u = 

“(w)+j3(u) 

BT 

- s I ^§Ml|) 

n=-BT 


where 


and 


BT 

pm - ± 


nu 

2B 


n=-BT 


= 0 


•l> 


2*B 


For a given frequency e(u>) is a weighted sum of all the so that hy the use 

of the central limit theorem 7 it can he 6hown for large BT that the random 
variables a and B are independent and the probability distributions of the 
amplitudes of a(w) and p(u) are represented by 


P(a) = 


(p 2 *) 


1/2 


exp 


fa 2 ! 

1p 2 J 


and 


p(|3) ^175 e: ® 


(13) 


(p 2 *) 


for 


where 


< 


0) < 2*B 


pa = <|.K)|‘> 


Since erj(t) was assumed to be random noise, the e± are statistically independent 
and consequently p 2 is not a function of frequency. The relation between the 
variance of the time samples and the variance of the frequency samples is given 


as* 




(U0 


Furthermore, the joint distribution of a and P is the bivariate normal distri- 
bution 


P(a,P)dotdP = -i- exp - i a 2 + P 2 i dadB 

psjt L p 2 J 


( 15 ) 
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These expressions describe the statistical behavior of noise alone irrespective 
of the true signal, but these formulations readily permit the characterization 
of the transformed noise superimposed on the true spectrum. 

From (9) the Fourier transform of the received time record erp(t) is 

'^£erp(t)] ss^terpCt)] +^[&j>(t)] (l( 

or E(w) = E?(u)) + e(u) 

where = a(w) + Jb(u) 

and e(w) = a(u) + jP(a>) 

From (8) and (l6) the spectrum of intensity as a function of frequency becomes 
for the general case 


=| [a(u)-wt(w)]+j[b(w)+3(w)] 


The probability distribution of the variable 


1 2E(u>) 


, where the transform of the 


noise is described according to ( 15 ) will be representative of the way the noise 
is distributed over the spectrum. 

For each sample, aCu^) and bCw^) are fixed, and can therefore be 

considered by means of the variables 

x = — (a+a) 

T 


y = fft+P) 


From (15) P(x,y)dxdy = expjg^ (x - ■§&) + (y - 


where 
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Therefore, from (l4) 


(19) 


TkZ = -i-a 2 
BT 


1/2 


2E(u?) 


T 


and 


The variables in (l8) may be changed so that r = (x^y 2 ) 

9 — tan \/x.. When this is done then it may be readily shown that the desired 
distribution p(r)dr is Rician. 8 That is. 
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where n = (a 2 tb 2 ) and I 0 (z) = ) (tH / (ml) a is the zero-order hyperbolic 

n*=o / 

Bessel function. An alternate form of I 0 (z), useful for large values of the 
argument, is 

,z 


Io(z) ’ ( 1 + & + iJ? + '••) 


(21) 


(2«z) 


The properties of the density function (20) may be described by its mome nts 

B 


which are given by 


= (2 if 2 ) r(f+l) exp 


Ji 
24F 2 . 


( 22 ) 


where M is a confluent hypergeametric function. 
In particular , 

^r 2 ^ = n 2 + 2^ 

and for no signal present (only noise) 


< r > =*(f/ 


.1/2 


(23) 


(2b) 


and 


<j 2 ^ = 2\j/ 2 
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Expression (20) represents the probability distribution of the amplitudes 
of the spectra •which include the true spectrum and, the super imposed noise. The 
particular shape of the distribution for any amplitude (any sample in the frequency 


domain) is dependent upon the magnitude of the true si gna l. That is , 

> 1/2 


P = -| |[a(w)] 2 + [b(w)3 2 ^ 


(25) 


Figure 1 shows in normalized farm tP(r) vs. r/t far several values of 


n/t. 


<Fig . 1 


Far purpose of interpretation (20) can be closely approximated in two 
particular cases: First far p = 0 and far small p over the range of 

r « then 



and 


P(r)dr 3 -^expjglj dr 


(26) 


This is the Rayleigh distribution presented in normalized form in Fig. 1 with 
p/\|r = 0# The significant point is that the distribution for zero signal is not 
centered about zero. In fact the peak far the Rayleigh distribution occurs at 
r = +. The first and second moments are given by (24) as is shown in Fig. 1 by 
the curve for p/t = 0. 

Consider the second case where the amplitude of the true spectrum is large 

compared to f, that is, p » lfr. For large z from the series form of (21) 

1 /2 

Io(z) z e z /( 2«z) , so that (20) can he approximated by 


P(r)dr 



dr 



( 27 ) 
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which for r z p further closely approximates a normal distribution 


P(r)dr » 


( 2 ^) 


1 /2 




(28) 


It should he pointed out that if the conditions are met so that d(w) =0 
in (2) for all w, then only the cosine transform need he computed to determine 
the spectrum. The noise distribution on the computed cosine transform is then 
given by the first equation of (13) , and the spectrum, as a function of frequency, 
Is 

r i _ _ i= [a(co)+a(u))] 

T T 

The probability density function of r' bee canes from (13) 


(29) 


NOISE COMPARISON 


Equation (20) and its approximation (27) characterize the statistics of the 
transform of the interferogram which includes signal plus normal (Gaussian) noise. 
Now a comparison of interferometers and conventional spectrometers with respect 
to errors caused by normal (Gaussian) noise can be made. The only noise which 
will be considered is that which originates in the detector. 

Detector noise is gener ally ascribed to two sources; that which is intrinsic 
in the detector itself and that which is generated in the detector by the statis- 
tical fluctuations of the incident photons. Detectors whose intrinsic noise is 
less than that due to incident photon fluctuation are known either as photon 
limited or background limited detectors. In the case of the conventional spectrom- 
eter the signal-to-noise ratio is not a function of the source of the detector 
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noise . This is not the case , however , for the interferometer spectrometer . If 


the interferometer detector is background limited, the detector sees simultane- 
ously the total radiation from all of the spectral elements while the detector 
of the conventional spectrometer is affected only by the radiation from one 
spectral element. Thus , the interferometer detector noise is M times greater. 
Only the non-background limited detectors will be considered at this time, with 
the background limited detectors being treated later. The former class of 
detectors is both large and important , being as all detectors used in the 
infrared spectral region are considered to be non-background limited. 

To compare the noise errors of a conventional and interferometer spectroanr 
eter it is necessary to assume the same scan time, T, for the conventional 
spectrometer as far the interferometer and the same number of resolved spectral 
elements M, where M = BT. Then the time allowed for the measurement of each 
resolved spectral element is l/B and the required bandpass is approximately 
B cycles per second. Therefore, the noise can be limited to a bandwidth of B 
and the probability distribution of the noise is characterized by the same o 2 
as used in Eq. (ll) . Thus, the amplitude error per spectral element for conven- 
tional case obeys the statistics expressed by (ll) . Also, one may write in an 
analogous manner to (10) Si = Si + £j_ where Si is the true signal amplitude 
and €i is the superimposed noise error. Hence, for the statistical represen- 
tation of each sample (amplitude) Si, the true signal Si’ = Mi is fixed and 
can be considered as the mean so that the probability density function of the 
amplitudes is 

P(S)dS i-jys expj- ^a(S-^) 2 ] dS (30) 

(2JI0 2 ) 
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By way of comparison of amplitude errors for the interferometer spectrometer 
and the convention a l spectrometers, note the results (20) and (30) and in 
particula r (28) an d (30) for large signal values. The variance indicates the 
spread of the distribution and the smaller the variance the more peaked the 
distribution becomes. From (l8) 


f n "\ _ 

^ = VBTy^ 
\ / 


Hence, it can be concluded that the amplitude noise error per spectral element 
for the interferometer belongs, with a given probability, to a sma lle r amplitude 
range. That is, the time domain amplitude errors are redistributed into amplitude 
errors in the frequency domain where the most probable amplitude range has been 
decreased. This is illustrated in Fig. 2 , where arbitrarily BT was chosen < 
to be 100 . Where it has yet to be shown that BT is the number of resolvable 
components in the spectrum. 

On the basis of these results, the noise powers per spectral elements for 
the conventional spectrometers and the interferometer can be computed and 
compared. Any computed spectral amplitude is of the form 



r = § :?-[e T (t)] 


,1/2 


and the corresponding true amplitude is 

P = | |[a(w) 3 2 + [b(w)] 2 j- 
The mean square error <^(r - p) 2 ^> is then the average noise power, Nj 2 , 


which may be expressed as 


= <^(r - p) 2 ^> = <^r 2 ^> -2 <^r^> p + p 2 


Ik 



using Eq. (23) , 


Nj 2 = 2*® + 2 m^i - ^ (31) 

Tha mean of r , <^r ^ , has been computed from ( 22 ) and Nj 2 evaluated as 


<Fig 


a function of true signal, p. The results are plotted in Fig. 3 * It is seen ^ Fig. 3 
that the noise power is a function of the signal and that the maximum noise power, 

Nlrnax’ occ ' ur6 a t U = 0 and in terms of variance 

O 2^ 


The minimum noise power occurs where p/V = 1, in which case N j^ ^ = O.87 if 2 - 
Furthermore, for larger u/t, Nj 2 quickly approaches if 2 -. Relating t 2 to the 
variance (or average noise power) of the tine amplitude noise by (l 8 ) gives 



(32) 


For the special case where only the cosine transform describes the spectrum, 
the probability density function was expressed by (29) • Since (29) is a Gaussian 
(normal) distribution, the average noise power per spectral element is i / 2 which 
is clearly independent of the signal a(w) . Hence (32) is the average noise 
power for the interferometer for this case. 

In the time record, the amplitude noise power was o 2 . Since the same 
additive noise occurs directly on the spectrum of the conventional case, then 
is the average noise power per spectral element assuming the same scan time 
T and bandwidth B in both cases. 

It is desired now to prove that BT is equal to M, where M is the 
number of spectral components that an interferometer is able to resolve. Suppose 
that a source emits energy of two frequencies f x and f 2 . The interferometer 
produces an output whose Fourier transform is of the form 
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3t«r(t)] - 1 *1 

2 «T(f-7fi) 


+ ^ da 


slmtT(f-7f 2 ) 

«T(f-7f 2 ) 


(showing only the positive frequency terns) . According to the Rayleigh 
criterion, the two spectral lines are independent and resolvable when the maximum 
of one coincides with a first zero of the other. In the above example, then, the 
frequencies f x and f 2 are resolvable if 6f = yj f x - f 2 j = -i. If the spectral 
range of the source is from f a to fj, and the resolution Is 5f , then the 
number of distinguishable independent spectral elements is M «= (f^ - f a )/6f. 

In the frequency mapping of the interferometer fj, - f a = B and 6f * i so that 
M = BT. Th u s, from (32) the noise voltage for an interferometer is 

111 -jt (33) 

From this it may be seen that the noise voltage of an interferometer is l/JR 
times the noise voltage far a spectrometer. Since the signal levels are the 
same for both, this results in a Jm signal-to- noise advantage far the Inter- 
ferometer, which has been called Fellgett's advantage. This advantage holds 
for non-background limited detectors. If the detector Is background limited, 
then the mean rate of photons ^ I x ^ incident on the detector, and therefore 
the noise power, is M times as great for the interferometer as for the conven- 
tional spectrometer. Thus , ( 33 ) , for interferometer noise voltage, becomes 
Nj = a and Fellgett ’ s advantage is cancelled. 


SUMMARY 


The probability distributions which govern the noise superimposed on the 
signal far both the interferometer and conventional spectrometer are derived 
and compared. For interferometers, the variance, and hence the spread of error 
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amplitudes per spectral element , is less than for the same time observation and 
same resolution for the conventional case. Furthermore, the noise power in the 
spectrum is shown to be dependent on the signal, and varies from 2o 2 /M to 
approximately o^/M and hence the signal-to-noise voltage ratio comparisons 
show an advantage of from n/M/ 2 to approximately *Jm for the interferometer. 
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FIGURE LEGENDS 


Fig. 1.- Nor m al i zed probability distribution curves of the amplitude of interfer- 
ometer spectra which includes the true spectrum and the superimposed noise for 
different values of the true signal ju/ijr , This demonstrates the dependence of 
the distribution on the amplitude of the true signal. Note: For zero signal, 
the distribution (Rayleigh) has its peak at r = t- 
Fig. 2.- Comparison of noise distribution curves for conventional and interfer- 
ometer spectrometers. Curves (a) and (b) are for a conventional spectrometer 
with P(S) platted against S, a variance of o 2 and u = 0 and 100, respec- 
tively. Curves (c) and (d) are for an interferometer spectrometer with P(r) 
plotted against r, BT = 100 and p = 0 and 100. 

Fig. 3 •- Noise power curve showing the dependence of noise power on the true signal. 
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2. Comparison of noise distribution curves for conventional and interferometer s])ectrometers. 

Curves (a) and (b) are for a conventional spectrometer with P(S) plotted against S, a variance of 
a 2 and |i = 0 and 100, respectively. Curves (c) and (d) are for an interferometer spectrometer with 
P(r) plotted against r, BT = 100 and p = 0 and 100. 



